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The cross section for jets from b quarks produced with a W boson has been measured in pp 
collision data from 1.9 fb _1 of integrated luminosity recorded by the CDF II detector at the 
Tevatron. The W + 6-jets process poses a significant background in measurements of top quark 
production and prominent searches for the Higgs boson. We measure a &-jet cross section of 
2.74 ± 0.27(stat.) ± 0.42(syst.) pb in association with a single flavor of leptonic W boson decay 
over a limited kinematic phase space. This measured result cannot be accommodated in several 
available theoretical predictions. 
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PACS numbers: 14.70.Fm, 14.65.Fy 



The measurement of associated production of a W bo- 
son and one or more jets from b quarks, herein referred 
to as W + 6-jet production, provides an important test 
of quantum chromodynamics (QCD). The understanding 
of this process and its description by current theoretical 
calculations are important since it is the largest back- 
ground to the search for the standard model Higgs boson 
via WH production with decay H — > bb [TJ [2] , to mea- 
surements of top quark properties via single [3l H] and 
pair production [5HZ] with decay t — > Wb, and to some 
searches for physics beyond the standard model |S]. 

Theoretical predictions for vector boson production 
with associated b jets have a large uncertainty. Summed 
fixed-order QCD calculations for W+bb + N- jets produc- 
tion are available for up to N=4 additional light flavor 
jets and take into account 6-quark mass effects |9 . The 
next-to-leading order (NLO) calculations for W + &-jets 
production in the 1-jet and 2-jet multiplicities show an 
enhancement over LO up to a factor of two for certain 
diagrams [10 12 . In order to minimize the impact of 
the W + 6-jets theoretical uncertainty in top quark prop- 
erty measurements and searches for WH production, the 
theoretical prediction for the cross section of W + b-jets 
production is not used in the evaluation of background 
estimates. Instead, the prediction from theory for the 
ratio of the event yields from W + b jets and W + in- 
clusive jets, corrected to match what is measured in data 
control samples, is scaled to the observed cross section 
of VF+jets in data. The systematic uncertainty on the 
W + 6-jets yield, driven by imprecise knowledge in the 
fraction of jets from b production, is approximately 30- 
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40% [IH7]. These uncertainties are very large compared 
to the small expected cross sections of the processes men- 
tioned above. We therefore wish to directly measure the 
W + 6-jet cross-section with sufficient precision to im- 
prove those background estimations. In addition, such 
a measurement will provide an important constraint on 
the theoretical predictions. Finally this measurement is 
a complement to other Tevatron measurements of vector 
boson plus heavy flavor jet production [rBTTTB] . 

In this Letter, we describe a measurement of the 6-jet 
cross section in events with a W boson in pp collisions at 
\/s=1.96 TeV from a data sample corresponding to an in- 
tegrated luminosity £=1.9 fb _1 acquired by the Collider 
Detector at Fermilab (CDF II) [5]. We select events that 
are consistent with the electronic or muonic decay of a 
W boson and contain one or two jets. Among the jets in 
these selected events, we seek those that originate from b- 
quark production. Hadrons that contain a b quark have 
a relatively long lifetime of —1.6 ps, and a large mass 
of -5.3 GeV/c 2 [H]. We exploit the B hadron's long 
lifetime by examining the charged particles within each 
jet and attempting to reconstruct a common origin for 
their trajectories that is well-displaced from the primary 
pp interaction location. The distance between the pri- 
mary and secondary vertices corresponds to the trajec- 
tory through which the relativistically boosted B hadron 
traveled during its lifetime. The technique is commonly 
known as vertex 6-tagging. 

The n tag tagged jets in the selected sample are not 
purely from b jets. Charm hadrons and certain light fla- 
vor hadrons have an appreciable lifetime, and hence jets 
containing these hadrons can be tagged despite not orig- 
inating from &-quark production. Also, the finite reso- 
lution of the CDF tracking system can allow for spuri- 
ous displaced vertices. In order to reduce contamination 
from charm and light flavor /gluon jets, the requirements 
on the quality of the secondary decay vertex have been 
optimized for this measurement. Further, we exploit the 
B hadron's large mass by examining the invariant mass 
of the charged particles forming the secondary decay ver- 
tex (vertex mass, M vert ). Vertex mass is correlated with 
the mass of the parent hadron and partially discriminates 
between the possible jet flavors to yield the &-jet fraction, 
f b . The number of b jets from other processes, n^^", is 
estimated for top quark pair, single top quark, diboson, 
and multijet production. The acceptance A\J^ is de- 
fined with respect to the restricted region of phase space 
defined below. The 6-jet identification efficiency, q? ag , 
and the event trigger efficiencies, e, are calibrated with 
data. The cross section for b jets times the branching 
fraction for one flavor of W — > tv decay is defined as 



5 



Ob jets x B(W -> tv) 



"-tag ■ f b -n 



bkR 



E (£ • ^ 

i—e,fi 



b jets 
VK+E> 



4. 



(i) 



where the sum is over the electron and muon channels. 

It is important to note that we quote our result as a jet- 
level cross section in order to avoid a model-dependent 
correction on the number of b jets per event that would 
be required to convert our result into an event-level cross 
section. Further the result is defined in a restricted re- 
gion of phase space for the kinematics and multiplicity 
of the outgoing particles in order to make comparisons 
with theoretical predictions that minimize extrapolation 
outside the experimentally accessible region. We define 
this restricted region of phase space as coincident with 
our analysis selection criteria, namely to events that con- 
tain one or two hadron-level jets with Et> 20 GeV and 
|ry| < 2.0, an electron or muon with px> 20 GeV/c and 
jryj < 1.1, and a neutrino with p T > 25 GeV/c [18]. We 
compute the theoretical predictions with these require- 
ments imposed as well. 

The data used in this measurement come from the gen- 
eral purpose CDF II detector [19] operating at Fermi- 
lab's Tevatron collider. Detailed descriptions of the vari- 
ous subdetectors new for Run II can be found elsewhere 
[2"0"H2"4"] . The data are collected with a charged lepton 
trigger that requires an electron (muon) candidate with 
M < 1.0 (1.1) and E T > 18 GeV (p T > 18 GeV/c). The 
identified charged lepton and the large missing transverse 
energy, $ T , from the undetected neutrino provide back- 
ground suppression compared to hadronic W decays. In 
offline event selection we require a single reconstructed 
electron (muon) with E T > 20 GeV (p T > 20 GeV/c) that 
is well-isolated from other activity in the calorimeter, and 
$ T > 25 GeV. A cone-algorithm-based jet reconstruction 
with cone size R = \J Ar/ 2 + A0 2 = 0.4 is used; jet re- 
construction and calibration are described elsewhere |25j . 
We require exactly one or two jets with E T > 20 GeV and 
\rj\ < 2.0. Events consistent with cosmic rays, Z — > £ + £~ , 
photon conversions, and multi-jet QCD production are 
rejected [2S]. In data from 1.9 fb -1 of integrated lumi- 
nosity, there are 175,712 events satisfying the W selection 
and jet requirements. 

The 6-tagging criteria have been designed for this mea- 
surement to obtain a significantly higher purity for b jets 
and thus reduce the overall systematic uncertainty from 
the model of the vertex mass distribution. With respect 
to the default CDF vertex 6-tagging 5], this optimized 
algorithm reduces the rate for false positives of jets from 
light quark flavors (u, d, s) and gluons by a factor of 10 
and charm by a factor of 4 at the expense of a 50% reduc- 
tion in efficiency for b jets. To be considered for 6-tagging, 
charged particle tracks within the jet cone are required 
to have pt> 0.5 GeV/c, and impact parameter signifi- 



cance 4 s - > 3.5, where the impact parameter do is the 

° d 

distance of closest approach of the particle track to the 
location of the primary pp interaction in the transverse 
plane with respect to the beam axis, and 8d a is its un- 
certainty. Particles must also have an impact parameter 
less than 0.15 cm, originate from within 2 cm of the pri- 
mary pp interaction location in the z coordinate |18j , and 
have at least a minimum number of hits from the silicon 
tracking detectors. These requirements reduce contami- 
nation from interactions with detector material, multiple 
pp interactions, and misreconstruction, respectively. A 
common decay vertex is sought among subsets of these 
selected particles, and if one is found that contains three 
or more particles, then the decay length in the transverse 
plane, L2D, is calculated as the projection along the jet 
axis of the displacement of the secondary vertex with re- 
spect to the primary pp interaction location. The vertex 
is required to have decay length significance ^ 2D > 7.5, 

° L 2D 

and pseudo-cr = L2 - p ' Mvcrt ' c < 1.0 cm, where the invari- 

C PT.vcrt 

ant mass M ver t and transverse momentum pT,vert of the 
vertex are calculated from the constituent particles. Note 
that the mass of each particle is set to the charged pion 
mass. Any vertices consistent with Kg and A decay, and 
nuclear interactions in the detector material are rejected. 
The sign of the vertex tag is determined by the position 
of the vertex with respect to the jet direction; those on 
the same (opposite) hemisphere as the jet direction are 
called positively (negatively) tagged. 

Among the events satisfying our event selection, n tag = 
943 jets are found to be positively tagged. The fla- 
vor composition of the positively tagged sample is de- 
termined through a maximum likelihood fit of the distri- 
bution of the vertex mass in the data. Simulated distri- 
butions for b and charm jets are formed from standard 
model processes that are major contributors to the se- 
lected event sample. Sources of b jets include W+b jets, 
which is simulated by the Monte Carlo event generator 
ALPGEN version 2.1.0 [27] with CTEQ5L parton densi- 
ties [25] and PYTHIA version 6.325 for hadronization |29| ; 
tt and diboson production are simulated with pythia 
version 6.216, and single top quark production by made- 
vent version 4.2.11 [30]. The yields of tagged jets from 
these processes are determined from the simulated sam- 
ples scaled to the latest theoretical cross sections [3"lTf53"] 
with the event selection requirements applied. 

We check the simulation model of b jets against an in- 
dependent data sample from double-tagged dijet events 
collected with a single pt> 9 GeV/c muon trigger. One 
jet is required to contain the muon, presumably from 
semileptonic B hadron decay. The other tagged jet in 
these events is a sample whose 6-jet purity is estimated to 
be above 99%. This sample is used to validate the model 
of the 6-jet vertex mass; the agreement between simula- 
tion and data is shown in Fig. [T] We use the difference 
between simulation and data to estimate the systematic 
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uncertainty from the 6-jet model. 

Vertex tags of jets from charm hadrons are primar- 
ily due to PF+c-jets production, which is simulated with 
ALPGEN. Positive vertex tags of light flavor jets are mod- 
eled with a simulation of inclusive jet production from 
pythia. We use negatively tagged jets in the data as 
an alternative model for light flavor. This second light 
flavor model is used in the vertex mass fit to assess the 
impact of light flavor model choice on the result. 




Mvert (GeV/c 2 ) 

FIG. 1: Comparison of the vertex mass distributions of tagged 
b jets from a simulated sample and from data. 

The maximum likelihood fit of the vertex mass data 
distribution, shown in Fig. [2j is used to extract two pa- 
rameters: the fraction of jets from bottom hadrons f b , 
and the fraction of jets from charmed hadrons f c , where 
the fraction of jets from light flavors is / LF = 1 — f b — f°. 
The best fit is f b = 0.71 ± 0.05(stat.) corresponding to 
670 ± 44(stat.) tagged jets from bottom hadrons. From 
simulated experiments with flavor compositions similar 
to the data, we confirmed that our vertex mass fit pro- 
cedure returns results consistent with the assumed back- 
ground content. These simulated experiments indicated 
that the systematic uncertainties on the model of the 6, 
charm, and light flavor vertex mass distributions man- 
ifest themselves as relative systematic uncertainties of 
0.08, 0.01 and 0.03, respectively, on the fitted 6-jet frac- 
tion. 

This yield of 6 jets includes our signal but also con- 
tains a contribution from other processes with jets from 
6-quark production. We use simulated samples and the 
theoretical predictions for production rates of tt [31] , sin- 
gle top quark [32], and diboson processes {WZ, WW, and 
ZZ) [33] in order to estimate a contribution of 152 ± 21 
b jets from these processes. This includes a small contri- 
bution of 7.3 ± 0.8 jets from W+b-jets production with 
W — > tv, which is treated as a background. Sources of 




IVUfGeV/c 2 ) 

FIG. 2: Maximum likelihood fit of the vertex mass for tagged 
jets in the selected data sample. 

systematic uncertainty in the background yield of tagged 
b jets include the uncertainty in the 6-jet tagging effi- 
ciency in the data (a relative 6% uncertainty on all tagged 
6-jet yields), the uncertainty on the top quark and dibo- 
son predicted cross sections (a relative uncertainty of 10% 
on it and diboson, and 30% on single top yields, which 
translate to an overall 2% uncertainty on erf, j Gts x B) and 
the uncertainty in the accumulated CDF luminosity (a 
relative 6% on all yields). 

We estimate a contribution of 25 ± 8 6-jets from QCD 
multijet production, where mismeasured jets pass the 
lepton identification requirements and result in sufficient 
$ T . As this background is difficult to model with simu- 
lation, a complementary data sample was collected with 
the same high j»t electron trigger, but where the electron 
candidate failed at least two of the identification crite- 
ria [33]. This provides both a model of the Jp T distribu- 
tion, which is used to estimate the rate of QCD multijet 
background above our selection |34j . and a vertex mass 
distribution, which is used to determine the fraction of 
tagged jets from bottom hadrons. The model for tagged 
jets from multijet production is statistics limited; we re- 
cover statistics by relaxing the $" T requirement and per- 
form the vertex mass distribution fit for ^ T > 15 and 
20 GeV as well as the default > 25 GeV and use all 
three results to determine the fitted b fraction from mul- 
tijet production. The uncertainties on the QCD multijet 
tagged 6-jet background come from the modeling of the 
Jfirp distribution for the overall multijet normalization (a 
relative 30% uncertainty, which translates to a 1% un- 
certainty on <7b jets x 13), and the spread in the fitted b 
fraction from vertex mass distribution fits from the dif- 
ferent $ T thresholds (a relative 25% uncertainty, which 
translates to a 1% uncertainty on <j\, j e t s X B). 
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After subtracting the background of n^^ ts = 177± 22, 
we have a yield of 493 ± 48(stat.) tagged 6 jets from 
W+b production. We define the acceptance, A^J^, of 
our selection with respect to a restricted region of kine- 
matic phase space, as defined earlier. The phase space re- 
strictions are applied to the outgoing leptonic W daugh- 
ters and jets in the simulated W+b production ALP- 
GEN events. Hadron-level jets are defined by Sparty- 
Jet [35] as a collection of simulated final state particles 
that have been clustered using the same cone algorithm 
as in the jet reconstruction. A hadron-level jet is said 
to be 6-matched if it has AR < 0.4 with respect to a 
b quark in the simulated event. The matching considers 
6-quark candidates after showering but before hadroniza- 
tion. The denominator of the acceptance is the num- 
ber of 6-matched hadron-level jets in simulated W + 6-jet 
events that pass the phase space requirements as given 
earlier; the numerator is the number of 6-matched re- 
constructed jets in simulated W + 6-jet events that pass 
the phase space requirements and, in addition, the event 
selection described above through the jet multiplicity re- 
quirement. The weighted average acceptance over the 
electron and muon channels is found to be 0.68±0.03, 
where the sources that dominate the systematic uncer- 
tainty are the jet energy calibration (3%), the factoriza- 
tion and renormalization scale (3%), and the dependence 
of event kinematics on the parton distribution functions 
(2%). 

For clarity, we separate the 6-tag efficiency and several 
data-based corrections from the acceptance. The 6-tag 
efficiency is the ratio of the number of 6-tagged recon- 
structed 6-matched jets to the number of reconstructed 
6-matched jets in the simulated W + 6-jet events that 
have passed the event selection and phase space require- 
ments: e & ta™ = 0.177 ± 0.001(stat.). This value needs to 
be corrected by a factor of 0.88±0.01(stat.)±0.05(syst.), 
which quantifies the discrepancy in tag efficiency between 
simulation and data [36] • The corrected 6-tag efficiency 
is then e^ ag — 0.156 ±0.009. The final correction factor e 
is the average over all triggers of the product of the fol- 
lowing three terms determined from data: the fraction of 
events that happen in the luminous region well-contained 
by the CDF detector, with primary pp interaction within 
60 cm of the center of the detector along the beam line, 
0.963 ± 0.003; the efficiency of the trigger, 0.943 ± 0.004; 
and the correction factor for charged lepton identification 
efficiency, 0.969 ± 0.004. 

Having obtained all of the information needed as input 
to Eq.JTJ we measure the 6-jet cross section to be Uf, j ets x 
B(W -> Iv) = 2.74 ± 0.27(stat.) ± 0.42(syst.) pb with 
a W boson decaying to a single leptonic flavor within 
the restricted kinematic phase space defined earlier. The 
overall relative uncertainty on the measurement is 18%. 
This uncertainty is dominated by the uncertainty in the 
6-jet vertex mass model (a relative 8% on <r b j 0ts x B), 
the tag efficiency (6%), and the luminosity (6%). The 



results in the electron and muon channels were examined 
independently as a cross check and are consistent. 

Finally, we have determined the theoretical prediction 
of <Jb jots x B, using our kinematic definition above, at 
leading order from pythia and at summed fixed-order 
from ALPGEN. The pythia prediction is 1.10 pb and 
the ALPGEN prediction is 0.78 pb, assuming a Q 2 scale 
of M^y +p 2 ?w ] these predictions are factors of 2.5-3.5 
lower than our result. These are important comparisons 
given the wide use of these programs in the generation of 
simulated physics events at the Tevatron and LHC exper- 
iments. A NLO calculation of at, j e ts x B has recently been 
completed [37]; their prediction of 1.22 ± 0.14(syst.) pb 
is also low with respect to the measured value. Further 
study is underway to examine the differential cross sec- 
tion as a function of jet kinematics and compare to LO, 
summed fixed-order and NLO predictions. 
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